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Abstract 

Signatures of universality are detected by comparing individual eigenvalue distributions and level 
spacings from financial covariance matrices to random matrix predictions. A chopping procedure is 
devised in order to produce a statistical ensemble of asset-price covariances from a single instance of 
financial data sets. Local results for the smallest eigenvalue and individual spacings are very stable 
upon reshuffling the time windows and assets. They are in good agreement with the universal 
Tracy- Widom distribution and Wigner surmise, respectively. This suggests a strong degree of 
robustness especially in the low-lying sector of the spectra, most relevant for portfolio selections. 
Conversely, the global spectral density of a single covariance matrix as well as the average over all 
unfolded nearest-neighbour spacing distributions deviate from standard Gaussian random matrix 
predictions. The data are in fair agreement with a recently introduced generalised random matrix 
model, with correlations showing a power-law decay. 



1 Introduction 



'Econophysics', a hybrid of 'economy' and 'physics', has become a very active field of research in the 
past decade. In this paper we will focus on aspects of portfolio selection and market modelling. In 
particular we will analyse cross-correlations in financial time-series and compare to the predictions of 
Random Matrix Theory (RMT). We refer to the reviews [H [2] on this subject, as well as for other 
methods e.g. from field theory to [3J. 

A still debated issue in the community is to what extent the 'historical' determination of covariance 
estimators (i.e. based on past time-series over a finite temporal window T) can be trusted when 
forecasting the financial risk of a certain portfolio; put it differently, how reliably is the past going to 
shape the future? In a pioneering paper, Laloux et al. [4] used a comparison with RMT to cast serious 
doubts on the usefulness of historical covariance spectra in estimating the variance (and thus the future 
risk) of a given portfolio, questioning the widely applied procedure of Markowitz's theory based on 
Gaussian mean-field approximations. The 'measurement noise' due to the finiteness of the historical 
time series T was claimed in [4] to bury most of the relevant information encoded in the historical 
covariance matrices, thus impairing ab ovo much of the consequent predictions. In subsequent papers 
the work of [3] was repeated and refined to other quantities, using Gaussian RMT El [71 [8] , non- 
Gaussian heavy tailed distributions [HI [TUl Ell E21 HHJ HU H5] , or RMT with complex eigenvalues using 
lagged time series [161 [T7j . In addition clever methods were devised to detect meaningful correlations 
buried under the 'noise-dressed' regions of the spectra [13119], thus trying to mitigate the pessimistic 
forecast of [I]. 

A central tool for our comparison with empirical financial data is the so-called Wishart-Laguerre 
(WL) ensemble of random matrices. The WL ensemble contains random matrices of the form [20J 
W = (1/T)X+X, where X is a rectangular matrix of size TxN (T > N), whose entries are independent 
Gaussian variables in the simplest case, and X^ is the Hermitian conjugate of X. As such, the 
WL ensemble contains (positive definite) covariance matrices W of maximally random data sets. 
They have since appeared in many different contexts apart from quantitative finance, ranging from 
mathematical statistics [21] and statistical physics |22] to gauge theories [23], quantum gravity |24j 
and telecommunications [25] . 

By definition, the WL ensemble constitutes a 'null hypothesis', with the highest degree of random- 
ness and lowest degree of built-in information in the data-set, and therefore is an ideal benchmark for 
comparison with the a priori much richer correlations in the financial time series. 

Within the large-iV predictions of WL one has to distinguish between global spectral properties, 
taking into account correlations over a scale of 0(1) much larger than the mean level spacing, and 
local properties, probing correlations on the scale of the mean spacing (typically of 0(1/N)). It is well 
known that the latter are much more robust under small deformations of the Gaussian probability 
distribution of the matrix elements. Typical examples include the distribution of individual eigenvalues 
and individual spacings among eigenvalues in the bulk. 

While the global spectral density has received a good deal of attention in the works quoted above, 
the spacing distribution has so far only been investigated in [3 [6], histogramming the spacings between 
all consecutive eigenvalues after unfolding. One of the aims of this paper is to further investigate the 
local statistics (individual distributions), taking also into account the presence of non-Gaussian heavy 
tails in the spectral density. 

In order to test e.g. if the smallest eigenvalue follows the universal Tracy- Widom distribution [26J, 
as was proven only very recently for WL with real matrix elements [27], we immediately face the 
following problem: a single covariance matrix clearly does not allow for testing individual eigenvalue 
distributions. We are therefore led to define a meaningful way of generating ensembles of covariance 
matrices from a single data set. These are then tested against WL and its generalisation with power- 
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law tails that was proposed in [28] and further developed for local statistics in [13] as well as in this 
paper. 

This article is organised as follows. In section [2] we summarise the relevant predictions from RMT, 
both for the standard WL ensemble and its non-Gaussian generalisation with power-law tails. In 
section [3] we compare these to data from financial covariance matrices. This section is divided into 
two parts. In 13.11 we compare to the data from a single covariance matrix partly repeating previous 
analysis. In the second subsection 13.21 we define ensembles of matrices by chopping the covariance 
matrix, and compare to the distribution of individual eigenvalues and spacings. In section H] we offer 
some conclusions, followed by 3 appendices containing technical details and consistency checks. 

2 Random Matrix Predictions 

In this section we summarise the analytical predictions from RMT, both for the standard Gaussian 
model and its generalisation. We do not give derivations here but rather illustrate these predictions 
by comparing to numerically generated random matrices for both models. 

2.1 Global density and level spacing 

We first introduce the standard Wishart-Laguerre (WL) ensemble of Gaussian random matrices (also 
called chiral Gaussian ensemble). In view of our application to time series, in particular stock price 
fluctuations, we restrict ourselves to real matrix elements denoted by the Dyson index (3 = 1. Its 
probability density distribution of matrix elements is given by 

V W L (X) ~ exp [-cr 2 Tr(X T X)] , (2.1) 

where X is a matrix of size T x N (T > N) with real elements. The integration measure dX. is defined 
by integrating over all independent matrix elements of X with a flat measure. The joint probability 
distribution of the positive definite eigenvalues of W = (1/T)X^X (the so-called Wishart matrix), or 
equivalently the singular values of the matrix X is obtained by integrating over the angular degrees 
of freedom. Since we are only interested in two specific correlation functions in the large- N limit we 
give the results without derivation. 

The global spectral density defined as p(X) = (Tr <5(A — W)}, averaged with respect to (12.11) . In 
the large- N limit it is given by |29j 



p MP (x) = JJx - cX^)(cX + - x) , with x G [cX_,cXJ . (2.2) 

2ttcx 

This is called Marcenko-Pastur law, and we have normalised it to have integral and first moment equal 
to unit}0 (see e.g. figHJ). The endpoints of support are 

X± = (c-3 ± l) 2 , (2.3) 

where both the large- iV and large-T limit is taken with N = cT, keeping < c < 1. Indeed, the 
applications to times series analysis require T to be much larger than the number of stocks N <C T. 

A second quantity of interest is the spacing distribution between consecutive levels in the bulk of the 
spectrum, probing correlations on a local level of order 0(1/N). We make a distinction between global 
and individual nearest-neighbour distributions. The former is obtained by averaging over spacings in 
the bulk after unfolding (see appendix [C]) and is commonly used in comparisons to RMT. The latter 

In previous comparisons to data the inverse variance a of the distribution eq. (|2.1[) was used as a free parameter. 
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is defined as the distribution of the spacing s& between the kth and [k — l)st eigenvalue, for fixed k 
and N (see e.g. [30]): 

(Afe - Afc_i) 

and it provides information about local correlations at a given point in the bulk, without making any 
further assumptions. 

Within RMT the global and individual spacing in the bulk agree, and an excellent approximation 
for both distributions can be obtained by applying the so-called Wigner surmise (WS), based on a 
2x2 matrix calculation. As was explained and checked numerically in [13], the correct surmise for 
WL is obtained from the Wigner-Dyson (or Gaussian) ensembles (see also [5j [B] ) 



7T 



Pwd{s) = -s exp y--s 



7T 



(2.5) 



here for (3=1. It has norm and first moment of unity. Eqs. (|2.2[) and (12.51) are very well known and 
have been compared to financial data in previous works (see e.g. [U [U [31] and [5JE] respectively). 

We now turn to the recent generalisation of WL introduced in [28^113] . It can be derived from a su- 
perstatistical or generalised entropy approach, representing an interpolation between the fully chaotic 
or random statistics of the Gaussian WL, and a regular or integrable behaviour. The generalisation 
of the probability distribution of matrix elements eq. (|2.ip reads 



V a (X) 



1 + 



1 



-(a+^NT+1) 



a + ±NT + 1 



Tr(X J X 



a > 



(2.6) 



where the A-dependence is required by convergence. The resulting global density that generalises the 
Marcenko-Pastur density eq. (|2.2p is reading [28] 



p a (x) 
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(2.7) 



and it displays a power law tail [13] . The density is normalised to unity and has first moment equal to 
one (see fig. U] for illustration). The integral can be computed in terms of a confluent hypergeometric 
series [13] . For large arguments the density decays algebraically as p a (% S> 1) ~ x~ a ~ 2 . For small 
arguments the density is suppressed exponentially, and for a more detailed discussion we refer to [13] . 
The standard WL quantities are recovered when a — > oo. 

In complete analogy with the standard WL ensemble, we can define the spacing distribution for 
the generalised ensemble eq. (|2.6p that generalises eq. (I2.5p . It is based on a Wigner surmise for 
the corresponding generalised WD ensemble (see also [32]). Although this quantity was not derived 
in [13], it follows in analogy to the corresponding quantity in p3] where generalisations of (|2.5p with 
(stretched) exponential tails were considered. We shall therefore be brief and only quote the answer 
for (3=1 relevant here: 



Pa(s) 



TTS 



2a 2 r(a + 1) 



dtt 



a+2 



exp 



-t 



Tit 



(2. 



The norm and first moment are again chosen to be unity. Exploiting the solution of the integral in 
terms of Hypergeometric functions it is easy to see that the local spacing distribution decays with the 
same power as the global density eq. (|2.7p . p a (s S> 1) ~ s~ a ~ 2 . A saddle point evaluation confirms 
that (|2.8h reproduces (j2.5|) as expected when a — > oo. 
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We have verified numerically that the generalised spacing distribution (12, 8|) indeed agrees with 
individual spacings (eq. (|2.4p ) from random matrices from eq. (|2,6p . This is shown in fig. [TJ 

In both ensembles the results given so far, the global density and the global spacing, can be 
tested on a single realization of a random or data matrix. For individual spacings as well as for the 
distribution of individual eigenvalues we need an ensemble of matrices, and the predictions for the 
latter are given in the next subsection. 
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Figure 1: Test of the large- N individual spacing distribution in the generalised WL model eq. (|2.8j) 
(full line) vs. the individual spacing distributions (|2.4|) for k = 4, 6, 8 at N = 8, T = 16 and a = 3, 
from numerically generated random matrices with distribution (|2.6p . The spacing distribution for the 
standard WL Eq. (j2.5[) is added for comparison (pink dashed line). 



2.2 Smallest and largest eigenvalues 

For the Gaussian WL ensemble with real matrix entries, both the distributions of the largest and 
smallest eigenvalue have been rigorously studied by Johnstone [21], and very recently by Feldheim 
and Sodin [27], respectively. In a well defined scaling limit both cases follow a Tracy- Widom (TW) 
[26] distribution iq(s). More precisely, there exist iV-dependent constants a^\b^\a^j\bffi such 
that 

Prob[x min ( max ) <x}= F^x) (2.9) 
where we have introduced the following scaled random variables: 



Amm — /m j V^- iu 7 

b N 



^max 

Xmax = • (2.11) 



The Tracy- Widom distribution Fi(x) defined in eq. (|2.9p is given by 



Fi(x) = exp 



(2.12) 



ds(q(s) + (s — x)q(sY 

where q(s) is the solution to the second Painleve equation (PII) 

q(s)" = sq(s) + 2q(sf (2.13) 
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subject to the boundary condition of approaching the Airy function asymptotically, q(s) ~ Ai(s) for 
s — > co. This is also called the Hastings-McLeod solution of PII. 

As an independent check we have generated real WL matrices and compared the numerical distri- 
bution for the smallest and largest eigenvalues to Fi(x), see figure [2j 




Figure 2: The distribution of the smallest (left) and largest (right) eigenvalues for f3 = 1 WL, after 
centring and the rescalings eqs. (12 . 1QH and (|2.1ip . The full line corresponds to the theoretical curve 
eq. (|2.12p . the points originate from 60000 numerically generated WL matrices of size A = 80 and 
T = 320. To generate efficiently TW densities, see [33] . 



The result for the largest and smallest eigenvalues given so far is valid for the standard, Gaussian WL 
ensemble. Because of the implicit nature of the TW distribution, we have not managed to analytically 
derive the corresponding distribution valid for the generalised ensemble (j2.6|) with power law tails. In 
related models about the generalised WD class it was argued |344 [35] that a transition between TW 
and the Frechet or normal distribution takes place for the largest eigenvalue. 

Here, instead we have numerically produced the distribution of the smallest and largest eigenvalue 
in the generalised ensemble and observed the expected convergence to the WL distribution when a 
gets large, see fig. El Because we do not know the modification of the scaling relation in the generalised 
WL ensembles we show the unsealed data. 



3r 1 1 1 1 1 1 1 1 2 




Figure 3: Distribution of the unsealed smallest (left) and largest (right) eigenvalues for the generalised 
ensemble eq. (|2.6p for the case N = 8, T = 15 and different values of a. For comparison the 
corresponding distribution for the WL ensemble (dash-dotted black line) is given, which after a proper 
rescaling (eqs. ()2.10|) and (|2.1ip ) maps to fig. [2 
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An important observation we make is that the distribution of the largest eigenvalue is strongly modified 
due to the power law tail of the spectrum. In contrast the smallest eigenvalue undergoes only a mild 
modification. This is probably because the exponential suppression of the spectral density at the lower 
"pseudo" edge is not dramatically different from the square root edge in WL. This will be important 
when comparing to data in the next section. 

3 Comparison to Financial Correlation Matrices 

In this section, we compare the RMT predictions from the previous section to data from financial 
covariance matrices. Our motivation with respect to previous works has been two-fold. 

First, can we improve the fit of financial spectra to the global MP density of eigenvalues following 
the null-hypothesis of Gaussian random variable by introducing correlations among matrix elements 
that lead to power-law tails? A similar route has been followed previously in [12] albeit with a different 
model; our model and a first comparison was given in [13] . 

Second, can we test RMT predictions that are more robust under deformations of the Gaussian 
measure of WL than the MP density? The quantities we have looked at are the individual and global 
spacing distribution for standard WL, and the individual distribution of the smallest eigenvalue. The 
individual distributions of course require the definition of a suitable ensemble of homogeneous matrix 
objects to average over. The global spacing distribution was previously compared to data in [5j [6]. 

In the first subsection 13.11 we focus on quantities that can be inferred from a single financial 
covariance matrix, whereas in the second subsection l3.2l we define a new chopping procedure to generate 
ensembles of covariance matrices from one single instance in order to analyse the individual eigenvalue 
statistics. 

3.1 Power-law decay in the global density and the global spacing 

In this subsection we have analysed a single set of data given by N = 401 stocks and a time series of 
daily prices T = 970. The data were obtained from the S&P 500 Equity Index. It is representative of 
the U.S. equity market and hence of their economy including the 500 largest companies. We extracted 
the daily opening prices for the 4-year period 2002-2006. Stocks that did not survive in the index 
during that period were deleted from the analysis, hence the reduced number of 401 stocks. Henceforth 
we will refer to this dataset as set SP. 

The financial covariance matrix is obtained in the standard way [4] which we briefly recall. First 
the normalised return Gi(t) of the i-th stock at time t is computed as the following function of its 
stock price Si{t) 

Gi{t) = \og[Si(t + At)} - log[5i(t)] , (3.1) 

where the time step At is 1 day for our set SP. Then the temporal mean (Gi(t)) and variance <Tj are 
computed for each stock independently to give the normalised data matrix elements 

= am - (G.w) 

Clearly the over the period t = 1, . . . , T have variance one and vanishing mean. Then the eigenval- 
ues of the covariance matrix C = (l/T)X r X of these data are computed and histogrammed in fig HI 
The spectral density obtained in this fashion is normalised to unity and rescaled to have a first moment 
equal to 1. It can then be compared to the MP density (|2.2p which is parameter free under the same 
rescaling, and to the generalised density in (|2.7p . In previous works using the standard Gaussian WL 
ensemble a one-parameter fit was performed using a variance dependent MP density, arguing that the 
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outlying eigenvalues with respect to RMT would effectively reduce the inverse variance a in eq. (|2,1D 
|3j. No such freedom is left in our approach as the first moment has been already normalised to 1. 

The resulting agreement between the generalised density (12. Tf) and the data is very good after 
fitting the only free parameter a ~ 0.95... in the generalised model. This leads to a power-law decay 
with exponent a + 2 ~ 2.95. It has been observed independently in the framework of a similar model 
[12] that the global spectral histogram is better fitted by a density with power-law tails rather than 
MP. Despite many differences between the two models, the exponent 2.93 found in [12] is similar to 
ours, using a very similar set of data (daily returns of from S&P 500 Equity Index from 2003-2007). 




Figure 4: Comparison between part of the rescaled eigenvalue distribution from financial data set 
SP, and the macroscopic density p a {x) from RMT for the generalised model eq. ([2.7]) . in red and 
green respectively. The best fit gives a value of a « 0.95, which corresponds to a power-law decay as 
Pa{x) ~ x -2 ' 95 . For comparison we have added the parameter free MP density eq. (|2.2p . The inset 
shows all eigenvalues on a different scale. 



A second quantity we analyse is the nearest-neighbour spacing distribution which resolves local 
information of the order 0(1/N) and is much more universal, being the Fredholm determinant of the 
universal sine- kernel [36] . For a detailed discussion of universality issues we refer to [13] . 

In this subsection we specifically look at the global level spacing distribution, obtained from a 
single sequence of sorted eigenvalues (one matrix sample). 

After unfolding the spectrum, a standard procedure in RMT [37] where we use a polynomial 
fitting of the cumulative eigenvalue distribution (see appendix [C] for details), the normalised histogram 
of all spacings between consecutive eigenvalues is plotted in fig. [5j Note that we have made an 
important assumption here: we assumed that averaging over the spacings between different consecutive 
eigenvalues of a single covariance matrix converges to a single distribution function after unfolding. 
Whereas in RMT this is known to hold due to the self-averaging property (or ergodicity) this is by no 
means guaranteed for financial covariance matrices. 
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Figure 5: The unfolded global spacing distribution from financial data set SP vs the RMT prediction 
eq. (|2,5p for WL (black right curve), and eq. f|2.8j) for our generalised ensemble: a = 0.95 (red left 
curve) and the best fit with a = 8 (blue middle curve). 

The result in fig. [5] shows that the spacing distribution seems to deviate from the WS of standard 
WL, although our statistics is not extremely good. This is in contrast with previous analysis [5l [6] 
where a good agreement with WS was found 0. For comparison we give our generalised spacing 
distribution eq. (|2.8|) : from the value a = 0.95 as determined in fig. [J] we get a worse fit than from the 
WS. However, using a as a free fitting parameter the apparently best fit is given by the generalised 
spacing for a ~ 8, thus parametrising the deviation from WL. This can be interpreted as the system 
not being fully random, as we have seen from the global spectrum. 

Because our statistics is not very good we will come back to this question at the end of the next 
subsection 13.21 The ensembles from a chopped covariance matrix show the same effect more clearly. 



3.2 Universal individual eigenvalue distributions 

In this section we analyse the distribution of individual eigenvalues and spacings and compare them to 
the universal TW and WS distributions respectively (and their generalisation). In order to generate 
ensembles of covariance matrices we use two different methods here: 

Method 1: We take a long time series T of a small number N of stocks, and then chop T into I 
smaller time windows of equal length t, T = it. This creates an ensemble of t covariance matrices of 
size t x N of the same N stocks, with c = N/t £ (0, 1]. 

Method 2: We take a time series T of N stocks, and chop both N into k sets of n stocks, with 
N = kn, as well as chopping T into t smaller time windows of equal length t, T = it, as before. This 
creates an ensemble of kl covariance matrices of size t x n, thus mixing different sets of n stocks, with 
c = n/te (0, 1]. 

While the chopping in smaller time windows in method 1 (and 2) is unambiguous - apart from 
choosing the length of the time window t - the chopping into smaller subsets of stocks may seem at 
first quite hazardous, mixing in an arbitrary way fairly heterogeneous data. Surprisingly, we find the 
same universal properties from both methods. 

2 In the same paper no significant deviation from MP was found either, in contrast to our fig. [4] 
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To test the consistency of method 2 we have produced ensembles of the same size using several 
choppings and including different subsets of stocks (see appendix [B]) , and we detected a substantial 
robustness of ensemble properties with respect to random reshuffling. A clear advantage of method 2 
is that we can produce a much better statistics. Furthermore we have varied the length on the time 
window t in both method 1 and 2. Here we have taken care to choose t (and n) such that the resulting 
value 1/c remains bounded of the order O(10), in order to avoid the problems discussed in |31j for 
large ratios. 

We begin by using ensembles generated from method 1. This is done with our second data set DJ 
based on the Dow- Jones Index obtained as follows. Daily returns for N = 27 stocks were used in the 
time interval from June 1997 to October 2005, leading to a T = 17109. From this we have created 
an ensemble of 179 (356) correlation matrices of size 96 x 20 (48 x 20). Our choice was motivated by 
having some data with the same c- value as from method 2 below. 



0.15 0.2 



Figure 6: Method 1 using data set DJ and the ensemble 48 x 20. Left plot: superposition of the 
smallest (red circles) and second smallest eigenvalue (blue crosses). Right plot: superposition of the 
second largest (red circles) and largest eigenvalues (blue crosses). 



The full spectrum is given in fig. [TTJ in appendix [A] but we shall focus on the local properties here. 
The result for the unsealed first and second eigenvalue is given in fig. [6] left. The spacing between the 
two (as well as their individual widths) are of the order of the mean level spacing 1/N ~ 0.05. This 
indicates that a comparison to the TW prediction from RMT (and its generalisation) is meaningful. 

For comparison we also give the distribution of the largest and second largest eigenvalue from 
method 1 in fig. [6] right. Obviously the shape and width are very different, and the spacing is much 
larger than 1/N. Already the second largest eigenvalue is within the support of both the MP and 
generalised density, see fig. [TT] of the full spectrum in appendix [Aj From previous considerations 
[1] as well as our previous section [3] one could speculate whether or not the largest eigenvalues are 
non-random, containing information. 

Because in the WL ensembles TW describes both the smallest and the largest eigenvalues we could 
try to compare these predictions with the data. However, there are some hints that such a program 
is essentially ill-defined, due to the absence of a clear-cut separation between eigenvalues following 
RMT statistics and those carrying genuine information. The fact that we find a fit better fit to our 
generalised RMT with power law tails makes this even more difficult. On the other hand it has been 
argued in [18] that information could be extracted from underneath the density close to the right edge, 
using a power mapping method. 



10 



Figure 7: The rescaled smallest eigenvalue from method 1 using data set DJ. Left: ensemble of size 
48 x 20. Right: ensemble of size 96 x 20. 

For these difficulties we have fitted only the smallest eigenvalue with the TW distribution, with 
the result given in fig. [71 As was discussed in the previous subsection 13.11 we only expect minor 
deviations for the smallest eigenvalue in our generalised ensemble eq. (|2.6p . Although we only have a 
few points in the plot, we can at least conclude that the data are consistent with at fit to TW (as well 
as our generalised model). Note that this finding is invariant under doubling the size of the temporal 
windows, fig. [TJ left vs right. 

The TW distribution is well know to be strongly universal within RMT, being invariant under 
non-Gaussian deformations of the probability distribution eq. (|2.ip . by adding higher order terms in 
the exponent. We have therefore established that a trace of this universality is also seen in ensembles 
of financial covariance matrices. 



We can now repeat the same analysis using method 2. Here we reuse the data set SP from the previous 
subsection 13. 11 and chop the 970 x 401 matrix into ensembles of submatrices of various sizes. The full 
densities for submatrix size 48 x 10 and 48 x 20 of respective ensemble sizes 800 and 400, are shown 
in appendix [A] fig. [T2l The situation for the distribution of the first vs second, as well as largest vs 
second largest eigenvalue is similar as in method 1 described above, and we refer to appendix [B] for 
details. The fit to the universal TW distribution using method 2 is shown in fig. [HJ with the same 
conclusion as above. Here we have halved the chopping size in n (stock) direction, as well as doubled 
the time window t, which all lead to consistent fits. 



Figure 8: The rescaled smallest eigenvalue from method 2 using data set SP. Left: ensemble of size 
48 x 10, middle 48 x 20, right: 96 x 20. 
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In appendix [B] we also compared the distribution of the smallest eigenvalues for a fixed sub-matrix 
size 48 x 20, choosing different subsets of 20 stocks out of 400. Because the unsealed first eigenvalues 
lie on top of each other we do not repeat the rescaled fit to TW for these permutations. We have also 
analysed other sub-matrix sizes, and we have subpartitioned the data set DJ using method 2 as well. 
In all cases we obtained approximately the same level of consistency. 

We now move to the second local quantity, the individual spacing distributions for our ensembles 
as described in eq. (|2.4j) . Because of better statistics we restrict ourselves to ensembles generated 
from method 2. Since we are dealing with individual spacings, no unfolding is necessary here. All 
distributions are consistent with the WS eq. (|2.5|) relevant for the unperturbed WL ensembles. We 
do not find any trace here from our generalised ensemble, in spite of its better fit of the full density, 
see fig. CGJ 




Figure 9: The WS eq. (|2.5D vs the distribution of two individual spacings in the middle of the 
spectrum: 5th and 6th spacing of 10 eigenvalues, for ensembles of size 24 x 10 (left) and 48 x 10 (right) 
from method 2. 




Figure 10: The WS eq. (|2,5p vs the global spacings distribution, for ensembles of size 48 x 10 (left) 
and 48 x 20 (right) from method 2. The generalised surmise is displayed with the values a = 7, 8 
extracted from the global density in fig. [12] respectively, and with the best fitted value of a = 9 in 
both cases. 
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Let us return to the question of a deviation from the WS for the global spacing distribution from the 
previous subsection 13.11 Repeating the same analysis by unfolding the chopped data from method 2 
and then averaging over all consecutive spacings we again find a deviation from the WS in the global 
spacing distribution, but with a much clearer signal compared to fig. [5j 

As in fig. [5] we compare the WS and the generalised spacing eq. f|2.8j) . Like before the a-value 
determined from the global spectral density, see fig. Q21 does not give the best fit (although it is not 
as bad as in fig. [5]). Using a as a free fit parameter we can clearly give a much better fit than the 
WS. This may again be seen as an indication that the data are not fully random. 

4 Conclusions 

We have studied the spectral properties of empirical covariance matrices from financial data, by 
comparing to the predictions of two different Random Matrix ensembles with uncorrelated and power- 
law correlated random variables. While previous works have mainly focused on global properties of 
the spectrum we were interested in local properties such as the distribution of individual eigenvalues or 
the individual spacings between eigenvalues. The reason for investigating this matter was the strong 
degree of universality of these quantities within RMT. These individual quantities can be looked at 
without assuming self-averaging or ergodicity, which is apriori true only within RMT. 

This question automatically drove us to define ensembles of empirical covariance matrices starting 
from a fixed set of time series of different stock prices. Two methods were devised to generate such 
ensembles, either by averaging over just different time windows, or also over different sets of stocks. 
Within both approaches we found that our results for individual quantities are compatible with the 
standard Wishart-Laguerre (WL) ensembles of RMT, starting from the null-hypothesis of Gaussian 
random variables for the matrix elements; the distribution of the smallest eigenvalue in our ensembles 
upon centring and rescaling agrees with the universal Tracy- Widom distribution, and the spacing 
between the kth and (k + l)st eigenvalue for a fixed k in the bulk of the spectrum follows the Wigner 
Surmise (WS). 

The reason for comparing with two different ensembles was motivated by previous findings for the 
global spectrum; its fit to the Marcenko-Pastur (MP) density from Gaussian WL, which is known to 
be only weakly universal, could be improved introducing correlations among matrix elements that lead 
to a power-law decay. Such a deformed RMT was introduced by several authors and can be based 
on a deformed entropy or superstatistical approach, tailored to describe non-equilibrium or not fully 
random problems. 

The question was whether or not this power-law determined from an excellent one-parameter fit to 
the global density would also show up in the local statistics of the data. While we failed to detect this in 
the individual distributions - we expected only a small deformation for the smallest eigenvalue - we did 
see such a deviation in the global spacing distribution, where the average is taken over all consecutive 
spacings after unfolding. This was at least qualitatively the case for both a single covariance matrix 
and the ensemble of such matrices produced from method 2. The corresponding generalised WS was 
derived and numerically tested within the generalised RMT ensemble with power-law tails. 

The appearance of power-laws in the global spectrum on one hand, and more robust universal local 
RMT correlations on the other hand are quite reminiscent of findings in complex networks. Also here 
first investigations gave an agreement with the WS and spectral rigidity [38], whereas recent results 
for the latter follow a generalised RMT |39j . 

We conclude by mentioning some open problems. The distribution of individual eigenvalues in the 
bulk is also known in principle. However, its implicit, almost Gaussian form together with our limited 
statistics makes it harder to detect than the skewed TW distribution. 
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Finally we could ask ourselves how much the chopping procedures introduced to study individual 
properties wash out of the relevant information. Clearly the small matrix size that comes with it does 
not allow for many outlying eigenvalues. At least our method 1 seems to be safe, being merely a time 
average over correlations of the same set of stocks. Another potential drawback is that the chopping 
procedure is clearly not suitable for abrupt changes in the market, which would be diluted in averaging 
over time windows. Overall it is very interesting to see how well a simple one-parameter power-law 
RMT can describe deviations from the standard null-hypothesis of no correlations, both for the global 
density and global spacing distribution. 

Acknowledgements: Financial support by European Community Network ENRAGE MRTN-CT-2004- 
005616 (G.A.) is gratefully acknowledged. We would like to thank Richard Hawkes for sharing his 
data with us. 

A Global Densities of Ensembles of Covariance Matrices vs RMT 

In this section we display the global density of three different ensembles of covariance matrices gen- 
erated with method 1 and 2. The purpose is twofold. First we confirm that the generalised model 
eq. (|2.6p gives a better fit to the global density than the standard WL eq. (|2.ip . In this way we can 
determine the power a for the decay, that could be used in a comparison to local statistics. 

Second, we can see a qualitative difference from the global density of a single covariance matrix, fig. 
[4] in section [3l and fig. [TT] here: there are far fewer outliers here than there. In fact all points outside 
the (crude) MP fit in fig. [11] are due to the single largest eigenvalue of various ensemble members, see 
fig. [6] right. What could be reason for this? Of course the matrix sizes differ considerably, N = 401 
in the unchopped set SP vs N = 20 eigenvalues in set DJ using method 1. We cannot exclude the 
possibility that the chopping method reduces or washes out relevant information about correlations. 
In fact using method 2 we could somewhat expect that averaging over submatrices from different 
stocks will reduce the information content. 



B Consistency of Chopping in Method 2: Permutations 

In this appendix we check the effect of taking different choppings in method 2 by comparing different 
ensembles when randomly permuting the rows (stocks) into different groups. We have created 10 
different ensembles of the same size 48 x 20 from data set SP. For each ensemble the 400 stocks were 
put into a different partition of 20 blocks of size 20. 

The superposition of the smallest and second smallest eigenvalues from these 10 different ensembles 
generated by method 2 is shown in fig. [131 We find that both eigenvalues superimpose well to give a 
smoothed curve. For illustration we display the curves of the two eigenvalues averaged over these 10 
ensembles in fig. [13] right, to illustrate their width and separation (compared to a single ensemble in 

fig. ED- 

For curiosity we have done the same analysis for the largest and second largest eigenvalues within 
the same 10 ensembles as shown in fig. Q3J 
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Figure 11: Comparison between the full spectral density from an ensemble generated using method 1 
on set DJ (48 x 20 chopped ensemble). The generalised density eq. (|2.7[) fitted with a = 1, and the 
MP distribution eq. (|2,2|) are given by the full green line and dashed blue line, respectively. 

As we can see in fig. [TH the largest eigenvalues from the 10 different ensembles overlap smoothly. 
This fact makes its unlikely that in these ensembles the largest eigenvalues still represents the market 
movement as in [5j [6]. It would be very interesting to try to identify the largest eigenvalues in fig. 
[T4l as a largest generalised TW eigenvalue from RMT as shown in fig [3] right. However, due to the 
lack of an analytic formula within our generalised RMT, and in particular because of the (possibly 
o-dependent) scaling with N this is a difficult task. 




Figure 12: Comparison between part of the global spectral density from ensembles using method 2 
on data set SP: the 48 x 10 chopped ensemble (left) and 48 x 20 (right) vs the generalised density eq. 
(|2.7p (full red line) for a = 7, 8 respectively. The MP distribution eq. (| 2 . 2 j) (dashed line) gives a worse 
fit. 
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Figure 13: Consistency of the 2 smallest eigenvalues in method 2: 10 different choppings of the same 
size 48 x 20 are obtained by using different partitions of the 400 stocks into 20 blocks of size 20: 
the overlapping of the smallest (left) and second smallest eigenvalue (middle) for these 10 different 
ensembles. Right plot: after taking the average over the 10 ensembles the averaged smallest (red left 
curve) and averaged second smallest eigenvalue (right dashed curve) are superimposed. 




Figure 14: Consistency of the 2 largest eigenvalues in method 2 using the same method as in fig. [TBI 
the overlapping the second largest (left) and largest eigenvalue (middle) from 10 different ensembles. 
Right plot: after taking the average over the 10 ensembles the averaged second largest (black left 
curve) and averaged largest eigenvalue (right dashed curve) are superimposed. 

C Unfolding Routine 

Consider the cumulative distribution of eigenvalues: 

P{x) = N I dx'p{x') (C.l) 
Jo 

where p(A) is the smooth density of eigenvalues (normalised to 1). 

Given a set of m samples of N bare eigenvalues {A^\ . . . , A^} (with k = 1, . . . , m), we first define 
a set of unfolded eigenvalues {E[ k \ . . . , Ej§ } as: 

Ef ] = P(\f y ) (C.2) 

Then the set of spacings {s} to be histogrammed is computed by the nearest-neighbour difference 
among the E's within each sample. Clearly, for empirical data the main problem is to estimate 
reliably the cumulative distribution P{x). Given a regular grid of K points {0 < y\ < ... < yx} 
on the positive semi-axis, one can simply define an estimator P{y r ) for P(x) as the number of bare 
eigenvalues (in total mN) falling below y r , divided by m. This estimator approaches N for large 
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argument y. Then, in order to get a continuous distribution, one simply performs a polynomial fit of 
degree d on the {y r , P(y r )} pairs and approximates the true cumulative distribution with the fitting 
polynomial Pd(x) ~ P(x). 

The MATLAB code that performs the unfolding procedure as stated above and returns the posi- 
tions X of the bins and the normalised histogram Y of the nearest neighbour spacings can be retrieved 
from the public domain web site 
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